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§ : ABSTRACT 

o: 

^ \ A number of recent numerical investigations concluded that the remnants of 

^ I rare structures formed at very high redshift, such as the very first stars and bright 

\ redshift z k, Q QSOs, are preferentially located at the center of the most massive 

CsJ \ galaxy clusters at redshift 2; = 0. In this paper we readdress this question using 

a combination of cosmological simulations of structure formation and extended 

Press-Schechter formalism and we show that the typical remnants of Population 

_ III stars are instead more likely to be found in a group environment, that is in 

H ■ dark matter halos of mass < 2 • 10^^ /i^^M©. Similarly, the descendants of the 

j^ I brightest z ^ 6 QSOs are expected to be in medium-sized clusters (mass of a few 

lO"*^^ h'^yio), rather than in the most massive superclusters (M > lO"*^^ h'^yio) 

^~~^ • found within the typical 1 Gpc^ cosmic volume where a bright z ^ 6 QSO lives. 

pq ' The origin of past claims that the most massive clusters preferentially host these 

^ ■ remnants is rooted in the numerical method used to initialize their numerical 

m ■ 

cn ' simulations: Only a small region of the cosmological volume of interest was 

t"^~- ' simulated with sufficient resolution to identify low-mass halos at early times, 

^yQ • and this region was chosen to host the most massive halo in the cosmological 

O ■ volume at late times. The conclusion that the earliest structures formed in the 

^ ' entire cosmological volume evolve into the most massive halo at late times was 

/\^ • thus arrived at by construction. We demonstrate that, to the contrary, the first 

c^ ■ structures to form in a cosmological region evolve into relatively typical objects 

at later times. We propose alternative numerical methods for simulating the 

earliest structures in cosmological volumes. 

Subject headings: cosmology: theory - galaxies: high-redshift - early universe - 
methods: N-body simulations 



1. Introduction 

Rare dark matter halos at high redshifts host interesting astrophysical objects, especially 
before or at the end of the reionization epoch. One example is given by the very first Popula- 
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tion III (PopIII) stars formed in the universe at z > 40, which started the metal enrichment 



of the interstellar medium and the reioniz ation process (jAbel et al.ll2002l : ISantos et al.ll2002 



Bromm fc Larson! |2004J : iNaoz et al.l l2006l ). and possibly also produced intermediate mass 
black hole seeds that grow to become s uper-massive black h oles {Mbh > IO^Mq) within 
the first billi on year after th e Big Bang (jVolonteri et al.ll2003l ). Another example are bright 



6 QSOs flFan et al. 



at that time (ISpringel et al. 



2004) 



, cons idered to be hosted in the most massive dark matter halos 



20051 ). The luminosit y of such an object is powered through 



accretion onto a supermassive black hole (e.g. see iHopkins et al.ll2005l ). which may be the 
descendant of one of the first generation of PopIII st ars formed in the universe, at 2; > 40 
dMadau fc Ree^boOll : see also lXrenti fc Stiavellil [2007[ ) . 



Many numerical and theoret ical investigations have aimed at characterizing the proper 



ties of both PopIII stars (e.g. see 



2005: Reed et al. 



12005 



Gao et al. 



Hopkins et al. 



20051: 



200 



a 



Abel et al. 20021: Bromm fc Larsonll2004j : ICiardi &: Ferrara 



O'Shea fc Norman 



Springel et al. 



2005 



2OO7I) and of high redshift QSO 



Pi Matteo et al.l l2005l : iLi et all 120071 1. 



(e.g. see 

However, the rarity of these structures makes a fully self-consistent treatment even of the 
formation of the underlying dark matter halos typically outside the current capabilities of a 
standard cosmological simulation, as the dynamic range that needs to be resolved is too large. 
The main limitation at high mass resolution is the box size, Ibox'- The enforcement of peri- 
odic boundary conditions limits the power spectrum of density fluctuations to modes with 
wavelengths shorter than Ibox- This results in a severe bias on the measured number density 
of rare, massive halos, especially before the epoch of the reionization of the universe when 
the abundance of galaxies derived from numerical simulations can be underestimated by up 
to an order of magnitude (JBarkana fc Loebll2004l ). While the number density of rare ob 



iects can be estima t ed in the context of Pr e ss-Schechter-type mode ling (JLacey fc Colelll993 



Mo fc White! Il996l : ISheth fc TormenI Il999l : iBarkana fc Loebll2004[ ). studying the details of 
the formation histories of the first galaxies and QSOs, possibly including hydrodynamics 
and radiative feedback processes, requires high-resolution numerical simulation. 

Thus, given a simulation box that is large enough to contain one or more of the high- 
redshift halos of interest, the problem is to identify a sub-region that contains one of them 
for high-resolution simulation. In passing we note that this challenge is different from that of 
developing a simulation code that is able to adaptively increase the resolution when follow- 
ing the gas collapse that leads to the formation of the first stars. This ha s been successfully 
imple r aented with adap tive mesh refinement codes (for example ENZO - iBryan fc Norman 
19981 ). iGao et al.l (120051 ) proposed a method to identify rare structures formed at very high 
redshift by recursively resimulating successively smaller, nested, sub-regions of a simulation 
box at progressively higher resolutions. Specifically, their method resimulates the region of 
the box centered on the most massive dark matter halo, reidentified at higher redshift and 
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lower mass within the sub-region at each resimulation step, until the desired resolution is 
achieved in a small fraction of the entire simulation box. This procedure is well-motivated 
by the f act that number density of massive halos is increased in regions of large-scale over- 
density (JBarkana fc Loebll2004l ). and thus structure formation in the sub-region on small 
scales at early times is accelerated by sitting at the top of an over-density, an over-density 
that is known to exist because it collap sed into the massive halo identified in the previous 
resimulation step. The iGao et al.l (|2005|) method extends high-resolution resimula tion tech- 



niques used previously (e.g. see iNavarro fc White! Il994l : IWhite fc Springe 



2000h. and has 



been used recently by several au t hors to study the first stars and QSOs (JGao et al 
Reed et al.l[2005l : lGao etaDboO?! : E7et al.ll2007h . 



2005 



The recursive resimulation method for volume selection is very effective at identifying a 
region with a very high-redshift halo when compared to random selection of a region of equal 
volume within the parent simulation box, but still it does not lead to the rarest high-redshift 
halos in the volume. This is due to the stochastic nature of the formation and growth o f dark 
matter halos (e.g. see lPress fc Schechterlll974l : lBond et al.lll99ll : ISheth fc Tormenll 19991 ). The 
most massive dark matter halo in a given volume at some early time does not in general 
evolve into the most massive halo at a later time. The pr obability of th e most massive halo 
to evolve into the most massive halo, quantified following iLacey fc Cold (119931 ) . decreases as 
time passes and the characteristic mass scale increases. As we demonstrate, the probability 
falls far below unity for the evolution of the first stars and QSOs to the present time. The 
stochastic natur e of growth histories of dark matter halos is addressed and discussed in 
Gao et al.l (120051 ) . but unfortunately some of the subsequent studies using their method do 
not take into account this stochasticity and assume a very strong correlation between the 
location of the richest clusters ai z = and that of rare halos at very high redshift. 

In this paper we clarify this issue by presenting some basic, but often overlooked, results 
of Gaussian random fields in extended Press-Schechter theory to quantify the locations at 
later times of the first dark matter halos to form in a simulation box. We compare these 
analytical and numerical results to those published using recursive resimulation and highlight 
the benefits and limitations of that method. In addition, we propose an alternative method, 
based on analysis of the density field in the initial conditions, to select the sub-region of a 
simulation box that contains some of the earliest structure in the box without being biased 
toward the regions hosting the largest halos ai z = 0. The paper is organized as follows. In 
Sec. [2] we investigate where the descendants oi z = 6 QSOs are today, while in Sec. [3] we 
extend the result to the first Population III stars in the Universe. In Sec. H] we discuss the 
implications of our results for the recursive mesh refinement method and propose a viable 
alternative. Sec. [5] summarizes and concludes. 
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2. Bright z ^ 6 QSOs: where are they today? 

As a first step toward characterizing the assembly history of rare, massive, dark matter 
halos, we consider the link betwee n z ^ 6 QSOs (cons idered to be hosted in the largest 



protoclusters at that time, e.g. see ISpringel et al.l l2005l ) and the largest clusters at z = 



For this we use the publicly availa ble merger trees constructed upon the Millennium Run 
(ISpringel et al.ll2005l : lLemsonll2006l ). This cosmological simulation has more than 10^° dark 



matter particles within a volume of 500^ h ^Mpc^ and has been run u sing a concordance 
AC DM cosmology based on WMAP year 1 results (ISpergel et al.l 120031 1^1. 



From the Millennium Catalogs we select all dark matter halos with more than 10'^ 
particles (corresponding to a minimum halo mass of 8.6 ■ 10^^ H'^Mq) in the z = snapshot 
and all halos with more than 100 particles in the z = 6.18 snapshot. Within these catalogs 
we also identify the descendants at z = of the z = 6.18 halos and plot them in Fig. [TJ 
The figure immediately shows that the descendants of the most massive halos at z > 6 live 
in a variety of environments at z = 0. Some of the most massive halos at z = 6.18 have 
accreted relatively little mass by 2; = (a factor of a few of their initial mass), while others 
have increased their masses by more than hundred times, reaching M > 10^^ h~^MQ. The 
scatter is substantial. In particular the descendant of the most massive z = 6.18 halo in this 
simulation has a mass of only 2.2 • 10^^ H^^Mq aX z = 0, to be compared with the largest 
cluster in the box, which has M = 3 ■ 10^^ H^^Mq. Fig. [T] also quantifies the relation between 
progenitor and descendant halos in terms of the dimensionless variable u = 5l/a'^{M) used 
in extended Press-Schechter modeling. Here (j^(M) is the variance of density fluctuations 
over a scale that contains a mass M and 5c{z) is the critical value of an overdensity in linear 
theory at redshift z. 

A complementary picture is given by considering the mass distribution of the most 
massive z = 6.18 progenitor for the 10000 largest z = halos, that is with a mass M > 
4 ■ IO^^H'^Mq (Fig. [2]). Again a considerable scatter is present in the plot, with only a 
modest correlation between descendant and progenitor mass. These results reflect the fact 
that there are additional contributions to the density fluctuation power spectrum over a scale 
Ml compared to a scale M2 > Mi. This can be also illustrated by random walks generated 
using a AC DM power spectrum (see Fig. [3]). The tj^ical fluctuations at small scales greatly 



^Note that a change in the cosmological parameters, and in particular on the value of cg, the root mean 
squared mass fluctuation in a sphere of radius 8 /i^^Mpc extrapolated to z = using linear theory, does 
influence the details of our results, such as the expected mass of a QSO host halo, but not the essence of the 
relation between dark matter halos at different redshifts. 

^http://www. mpa-garching.mpg.de/millennium 
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exceed those at large scale, so that a walk with excess power at small scales often has a 
rather typical amplitude at large scales, and vice versa. 

A quantitative model for this behavior is available in the context of ext ended Press- 



Schechter theory, for example by Eqs. 2.15 and 2.16 in iLacey fc Cold (119931 ). Using this 
formalism one can compute the probability that a dark matter halo of mass Mi at redshift 
zi evolves into a halo of mass M > M2 at Z2 < z\ \P{M > M2, Z2\Mi, zi)]. This is shown in 
Fig. m where we plot the contour lines of P for a Mi = 5 ■ 10^^ h~^MQ halo at zi = 6.18. 
The median of the distribution at 2; = is M = 4 ■ 10^^ h~^MQ and at the 68% confidence 
level interval is M G [1.8 : 8.8] ■ 10^^ h~^MQ. In this respect the Millennium Run is typical as 
it lies within the la interval. From Fig. H] one can also immediately see that it is relatively 
improbable for the descendant of the most massive halo at z = 6.18 to be the most massive 
halo at z = 0. In fact there is only a probability p < 2% that the mass of the descendant 
halo is above 2 ■ 10^^ /i^^Mq. The Millennium Run has a volume large enough to contain 
z = halos more massive than 2 • 10^^ /i'^Mq (in fact there are two of them), thus it is 
expected that in less than 2% of Millennium-like realizations the most massive z ^ 6 halo is 
the progenitor of t he most massive z = h alo. As a reference we provide the number density 



contours from the ISheth fc TormenI (119991 ) mass function, integrated to obtain the number 
of objects above that mass at that redshift in the volume of the Millennium Run (red dotted 
lines in Fig. Hj): when z < 2.5, the upper 2a confidence level contour lies at a lower mass than 
the contour for the number density (n = 1) associated with the most massive dark matter 
halo in the volume. This mean that the typical mass of the descendants progressively shifts 
toward that of more common halos as the redshift decreases, as it is also immediately visible 
when looking at the redshift evolution of the value of u for the median descendant, which 
decreases progressively (small panel of fig. Hj). 



3. The very first PopIII stars: where are they today? 

A similar scenario holds for the relation between the location of the very first PopIII 
stars, formed in dark matter halos with M ^ lO^Mo at ^ > 40, and that of dark matter halos 
at 2; = 0. Here there is even less correlation than in the case of 2; = 6 QSO halos (see Sec. [2]) 
because the halo mass dynamic range involved is larger and thus there is an additional 
contribution to the scatter from modes at small sc ales in the densit y fiuct uation power 



spectrum. Using a simulation volume of ~ 1 Gpc^, iTrenti fc Stiavellil (120071 ) showed that 
the remnants of the very first PopIII stars formed in dark matter halos of mass 10^ H'-^Mq 
at z > 40 end up at z = in dark matter halos with a median mass of 3 ■ 10^^ h~^MQ, about 
two order of magnitude smaller than the largest halo in the simulation box. Here we confirm 
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and extend their result using extended Press-Schechter modeling. 



Using Eq. 2.15 and 2.16 of iLacey fc Cold (119931 ) . we compute at different redshifts the 
probability distribution for the mass of the descendant of a 10^ /^"^Mq dark matter halo 
formed at zi =40. The results are shown in Fig. [5l where we plot the median descendant 
mass versus the redshift Z2 (black line), the la confidence level contours (blue lines) and 2a 
confidence lev el contours (green lines). These analytic results agree well with the numerical 
simulations in lTrenti fc Stiavellil (120071 ) and confirm that at 2; = the typical remnant of one 
of the very first PopIII halos does not live in a supercluster, but is rather hiding in a more 
common group environment. This is, e.g., contrary to the conclusions ( "T he very oldest stars 
should be found today in the central regions of rich galaxy clusters" ) that IWhite &: Springel 
(j2000l ) drew from the resimulation of a massive galaxy cluster. Similarly, a.t z = Q the 
remnant of a typical z > AO PopIII star does not live in the largest dark matter halos of 
that time, but rather has seeded a dark matter halo of mass 3 — 4 ■ 10^'^ h~^MQ, typical for 
the faint z ^ 6 galaxies observed in deep surveys such as the Hubble Space Telescope Ultra 
Deep Field dTrenti fc Stiavellil boosl ). 



These results have been obtained for very rare PopIII stars formed at 2 > 40. Extended 
Press-Schechter modeling predicts that the remnants of more common PopIII stars formed 
at lower redshift live in yet lower mass dark matter halos. For example a 10^ h~^MQ dark 
matter halo formed at z = 20 has a z = descendant with median mass 4 ■ 10^^ h~^MQ. 
More massive, rarer, PopIII halos, with virial temperature above 10^ K, can cool by neutral 
hydrogen rather than molecular hy drogen so that their form ation is less sensitive to radiative 
feedback from other stars (e.g. see lBromm fc Larsonll2004l ). These halos have a typical mass 
of ~ 10^ /i~^Mq and if they form at z = 20, then their typical descendants are similar to 
those of a z = 40 IO^H'^^Mq dark halo (see Fig. [5]). 



4. Generation of Initial Conditions around Rare High-Redshift Halos 

Given the large scatter in the assembly histories of dark matter halos is there an optimal 
way to select a region of a simulation box centered around a rare high-redshift halo under 
the constraint of limited computational resources? 



4.1. Analysis of the recursive resimulation method 



The method introduced by iGao et al.l (120051 ) certainly presents a very competitive ad- 
vantage over a random selection of an equal subvolume though it identifies a halo with an 



-7- 



atj^ical accretion history, biased toward having an above average m erging rate and hying i n 
an environment that tends to have an overdensity of nearby halos (JBarkana fc Loebl l2004j ) . 
To better quantify the properties of halos identifie d by r ecursive resimulation, we use the 
Monte Carlo method presented in lTrenti fc Stiavellil (120071 ). which is based on the identifica- 
tion of virialized dark matter halos as density peaks in the linear density field. This approach 
predicts well and without introducing systematic biases the location and virialization red- 
shift of the very first dark matter halos when compared to the full non-linear dynamics of 
the simulation, even t hough there are some statist ical fluctuations on a halo-to -halo basis 



( Bond &: Myerd Il996l : iMesinger Sz Furlanettd 120071 ) . For the comparison with iGao et al. 
( 120051 ) we adopt the following parameters from their paper: (i) ^m = 0.3, il\ = 0.7, 
Qb = 0.04, h = 0.7, o"8 = 0.9, spectral index Ug = 1; (ii) parent box size edge hox = 479 Mpc 
h~^] (iii) largest dark matter halo in the box at z = of mass Mi = 8.1 ■ 10^'^h~^M 



Oi 
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final high-resolution simulation sphere of 1.25h~^ Mpc (with volume V5 = 8.18h~^ Mpc^ 
(v) the most massive halo in the final resimulation region R5 is Mhaion^ = 1-2 • 10^h~^MQ at 
z = 48.84. Based on these assumptions, our analysis yields the following results: 



The number density of dark matter halos of mass M > Mhaion^ is n{M > MhaioR^) = 
0.17/i3Mpc-3 (Sheth-Tormen mass function) or n{M > MhaionJ = 0m25h^Mpc-^ 
(Press-Schechter mass function) at z = 48.84, so these halos are not particularly rare. 
Specifically, in a random region of volume V5 at z = 48.84, the expectation value for the 
number of halos more massive than Mhr ,j. „^^ is greater than unity {n{M > M^a^o^^)- V5 = 



1.39) when using the lSheth fc TormenI (119991 ) formula. However, due to the high bias 
of the halos, b ~ 16, the actual fract i on of empty volumes as given by our Monte Carlo 
code is 98.7%. Thus the iGao et al.l (120051 ) method is a substantial improvement over 
random selection of a volume V5 within the parent box. 



The first halo of mass MhaioR^ = 1.2 ■ IO^/i^^Mq in the whole box virializes at z > 62 
at 99% of confid ence level. The possibility that the most massive halo identified by 
Gao et al.l (120051 ) in their final resimulated volume is the most massive of the whole 
box at that redshift is ruled out at a confidence level greater than 1 — 10~^ (confidence 
level limited by the precision of the numerical integration) . 

The first halo of mass Mhaion^ that ends up at z = in a dark matter halo of mass 
Ml = 8.1 ■ 10^^/i-^ MfT, is forn i ed at z > 50.8 at 99.9% of confidence level. Thus it 
is unlikely that the JGao et al.l (120051 ) halo of mass MhaioR^ at z = 48.84 is the most 
massive progenitor of the cluster that will contain it at 2; = 0. 



From this analysis it is indeed confirmed that recursive refinement works well to select 
a sub-region for resimulation that host a rare high-z dark matter halo, but still this halo is 



not one of the very first of its kind in the box. If the interest is primarily in the local physics 
around one of these rare s tructures, a viable a lternative is the constrained realization method 



( jHoffman fc Ribald Il99ll : iBertschingerl |2001| ). A constrained realization does not require a 
hierarchy of resimulations, but rather introduces the overdensity in the initial conditions by 
construction, so that the realized initial conditions do not carry information about its rarity 
or its typical surrounding environment. 



4.2. Unbiased selection of rare halos: the density field method 

If one is instead interested in selecting one of the very first dark matter halos in a 
given simulation volume, with a particular interest in representative halos, that is halos with 
unbiased accretion histories, we propose instead to select the final region of interest from 
a density field at uniform resolution. When the dynamic range of the resimulation is not 
too large, it is of course possible to identify the region to be refined directly from the low- 
resolution dark matter halo catalogs. If this is not possible, then the resimulation volume 
can be identified from a high resolution, but uniform, refinement of the density field. The 
idea is the following: 

1. A high- resolution density field is generated over the whole box. The mass of a field cell 
is that of the high-z dark matter halo of interest (e.g. 10^h~^MQ for a PopIII halo). 
A technical, but important, detail is that a top-hat filter in frequency space must be 
used when generating this field, as this guarantees that the mass function obtained 



from peak analysis matches that of iPress fc Schechterl (jl974j ) 



2. The highest peak in the high-resolution field is identified. This is the location where 
one of the first halos at the mass scale of the density field is formed within the whole 
simulation box. 

3. A region around that peak can now be selected for resimulation, with additional fluctu- 
ations added over the high resolution dens ity field, for example by using the GRAFIC2 



refinement method of IBertschingerl (120011 ). Outside the selected region the high reso- 



lution field can be degraded if necessary to generate more massive particles. 

This method does not necessarily require overwhelming computational resources. For 
example, to select the largest z = 6 dark matter halo in a volume relevant for the study of 
bright high-2; QSOs, a mass resolution of ~ 5 x lO^^/i^^Mo is more than sufficient, which 
translates into a A^ = 512^ density field grid for a cosmic volume of ~ {IGpc/h)^. Such 
a grid only requires 512MB of RAM for storing. Even identifying the positions of the first 
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PopIII stars in a large volume is not unrealistic. A A^ = 2048^ density grid requires only 
32 GB of RAM and provides a mass resolution of 1.2 x lO^/i~^M0 = Mhaioaa (the first halo 



m 



Gao et al.l l2005l ) over a volume of (23.74/i ^Mpc)'^. Given the number density of such 
halos — n ^ 0.11/i^Mpc~^ — about 1471 will have formed on avera ge in the box hj z = 48.84. 



In fact, using the Monte Carlo code of iTrenti fc Stiavellil (120071 ) we predict that there is a 
probability p < 10~^ of not forming one of these halos before z = 48.84. The median redshift 
of formation for the first halo of mass MhaioRs is -2 = 51.6. With larger resources one can 
generate a 4096^ density grid over a volume of (100Mpc//i)^, obtaining a mass resolution 
of lO^/i~^M0. This requires about 256GB of RAM, easily available on a parallel computing 
cluster of moderate size. With a top-end supercomputer one can use more than 10^^ cells, 
with a dynamic range that permits identification of smaller mass halos in the same volume 
or an increase the volume of the parent box for the same assumed PopIII halo mass. 

Our proposed density field method has also another advantage: As we generate a uni- 
form, very high-resolution density grid over the whole box, aliasing erro rs in the zoom refine- 
ment proced ure are less severe t han if one follows the iGao et al.l (120051 ) recursive refinement 



method (see iBertschingerl l200ll for a detailed analysis of the errors introduced when large 



zooming factors are used). 



4.3. Preliminary testing of the density field method 

Detailed analysis and applications of our method to select high-redshift halos is deferred 
to a follow-up paper. Here we present a preliminary testing to assess its performance. For 
this we consider a A^ = 512^ cosmological simulation in a box of edge li,ox = 512/i~^Mpc 
(details on the simulation are discussed in iTrenti fc Stiavellil I2007D. We construct the halo 
catalog for a snapshot at z = 5 using the HOP halo finder (JEisenstein &: Hutlll998l ). finding 
that the most massive halo has a mass of 1.25 ■ lO^^/i~^M0 (187 particles). The density 
field used to generate the initial conditions for this A^ = 512'^ run is then (i) downgraded 
to a Ndown = 12 8^ grid (where one l ow re solution cell corresponds to 64 of the original grid 
cells) using the iHoffman fc RibakI (Il99ll ) constrained realization method and (ii) evolved 
in linear theory to 2; = 5. The highest density peaks in the low-resolution field are then 
identified and their location compared with that of the dark matter halos identified from the 
N-body simulation at full resolution. Within the ten highest peaks (with linear overdensities 
from 5 = 1.84 to 5 = 1.75), six of them are associated with dark matter halos with more 
than 64 particles, including all the top three overdensities. The top overdensity is matched 
to the fourth most massive halo of the snapshot, with 141 particles (versus the 187 of the 
most massive). Of the remaining unmatched peaks, two are associated with halos with less 
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than 64 particles and two appear to be still in the process of virialization, so the central 
density of their groups in the N-body run does not qualify them as halos. The most massive 
halo in the box is missed by the density field analysis because its particles are spread over 
several adjacent cells in the downgraded density field. This is an intrinsic limitation of our 
method rooted in the use of a fixed grid in the position space, which is bound to miss some 
of the the dark matter halos that are off-center with respect to the spatial location of the 
grid cells. However this onl y leads to miss a random fraction of the rare halos, without an 
environmental bias as in the iGao et al.l (120051 ) method. 



5. Conclusion and Discussion 



By means of both analysis of numerical simulations and of extended Press-Schechter 
modeling we investigated the relation between the most massive dark matter halos at different 
redshifts. The main conclusion of this work is that— contrary to expectations from many 



recen t works (e.g. see ISpringel et al.ll2005l : iReed et al.ll2005l : iGao et al.ll2005l . 120071 : iLi et al. 



20071 ) — the most massive halo at a redshift zi > Z2 does not necessarily evolve into the 
most massive at ^ = Z2. This is a robust conclusion that can be naturally understood in 
the context of growth of dark matter density perturbations, for example by constructing 
merger trees through the iLacey fc Cold (119931 ) model. Rare high-redshift objects, such as 
the remnants of the first PopIII stars and QSOs, are not hosted at z = in the most massive 
halos, but rather live in a variety of environments. For example, the typical 2; > 40 PopIII 
star remnant lives in a dark matter halo that at 2 = has a mass of ;^ 2 ■ lO^^/i'-^Mcr^ , typica l 
for a galaxy group, and not within rich clusters as claimed by IWhite fc Springell (120001 ). 
Similarly the descendant of a t ypical z " ^ 'o Q SO is not located within the most massive 
clusters at 2; = as assumed by iLi et al.l (120071 ) . 



These conclusions have ir nportant consequ ences on the application of the recursive sim- 
ulation method introduced by ICao et al.l (120051 ) to identify high-redshift rare halos based on 
progressive refinement of regions centered around the most massive z = cluster. In fact, 
while the recursive method is indeed effective at identifying a sub-region of the simulation 
with earlier-than-average structure formation, it finds neither the earliest structures in the 
box, which are dominated by small-scale density fluctuations, nor typical early structure, as 
it preferentially identifies objects located in the regions with the highest bias. 

These limitations may have only a minor effect when the goal is to investigate the 
fo rmation of one ra re Popu lation III h a lo in the simulation box, as it is done for example 
in iCao et al.l (12005! ) and in JReed et al.l (120051 ) . However in different physical scenarios it is 



important to correctly estimate the rarity of the halo simulated and to assess how typical their 



-11- 



growth histories are. This is critical if additional physics beyond gravitational interactions is 
included, such as star format ion and radiativ e feedba ck. One such exam ple is the formation 



of rare high-redshift QSOs: iLi et al.l (120071 ) use the iGao et al.l (120051 ) method to identify 
at z ~ 6.5 "the most massive halo in a ~ 3 Gpc^ volume" and then conclude that the 
QSO formed in this halo reproduces the properties of observed QSOs wit h the same nuni ber 



density. From our analysis in Sec. [2] it is clear that the halo identified by iLi et al.l (120071 ) as 
progenitor of the largest 2; = cluster is not likely to be the most massive at z ~ 6.5. Thus 
other similar or more massive halos are expected to be present at 2 ~ 6.5 in their ?^ 3 Gpc^ 
simulation volume: in principle any of these halos could host a bright QSOs, with important 
consequences for the comparison between observed and simulated QSO number densities. 
In addition, when the goal is to study the environment in which QSOs live, selecting host 
halos with the resimulation method introduces systematic effects in the results difficult to 
quantify and correct for, because these halos would have above-average growth (and merging) 
histories. 

To avoid selecting only the halos with atypical accretion histories and in an attempt 
to improve over the identification of some of the rarest high-redshift halos in a box, we 
propose instead to select the initial conditions for high-resolution resimulation based on 
the analysis of the linear density field at uniform resolution. The method, described in 
Sec. m identifies subregions of the simulation box with high-redshift halos as those with 
the highest peaks in the density field, requiring a mass resolution in the field comparable 
to that of the mass of the halos one wishes to select. The applicability of the method is 
thus limited only by the size of the largest density grid that can be accommodated in the 
availa ble memory , othe rwise requiring only a modest amount of computing time compared 



to the iGao et al.l (120051 ) method. This is because our method bypasses the need of a series 



of N-body simulations to be carried out in addition to the final run. 

Unfortunately, our method does not guarantee identification of the first halo on the 
desired mass scale, as highlighted by some preliminary testing we presented in Sec. 14.31 This 
seems still an elusive goal. When the density field is defined over a fixed grid, there is not a 
perfect match between the halo catalog constructed from the density field and that obtained 
by increasing the resolution of the field and then following the full non-linear dynamics with 
an N-body simulation. An extensive validation of our linear density field initial conditions 
generation and its application to the formation of the first bright QSOs will be discussed in 
a subsequent paper. 

This work was supported in part by NASA grants JWST IDS NAG5-12458 and HST- 
GO10632. We thank the referee for useful comments and suggestions. We are grateful to 
Gerard Lemson and to the Millennium Run collaboration for allowing us to use their merger 
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Fig. 1. — Mass at z = of the descendants of the most massive dark matter halos at ^ = 6.18 



in the Millennium Run (jSpringel et al.ll2005l ). The shaded area within the region with the 
highest density of points represents for a given z = 6.18 mass the 68% confidenence level 
interval for the mass of the z = descendant, while the blue central line is the median of 
the distribution. Some of the most massive halos at 2; > 6 have accreted only relatively 
little mass and a considerable scatter is present. The upper and right axes in the plot 
translate the halo mass into the dimensionless variable u = 6'^/a'^{M), used in the extended 
Press-Schechter formalism. 
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Fig. 2. — Mass at redshift z = 6.1 8 of the most massive progenitor of the most massive halos 
at z = for the Millennium Run (jSpringel et al.ll2005l ). i> values are given as in Fig. [H The 
few points with 77^2=6. is = 8-5 • 10^h~^MQ are associated to those z = halos that do not 
have a 2; = 6.18 progenitor identified in the simulation (that is its below the minimum halo 
mass of 8.5 ■ IO^^H'^Mq — corresponding to 100 particles). Given the considerable scatter 
in these plots, adaptive refinement of the most massive z = halo in a simulation would not 
necessarily lead to the most massive z ^ 6 halo of that box. 



-16- 



log,,(M h/M_) 

10 8 

T 



30 - 



o 




100 



o^U) 



Fig. 3. — Random walks generated using a Millennium run cosmology. The green shaded 
area represents for any value of o"^(M) — evaluated at 2; = — the area enclosing 68% of the 
walks and thus has amplitude ±cr(M). The red and blue walks have been selected from 10^ 
random realizations as (i) red: the walk with the largest overdensity at M = 3 ■ lO^/i'^M© 
and (ii) blue: the walk with the largest overdensity at M = 2 ■ lO^^/i'^M©. 
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Fig. 4. — Main panel: Contours for the probability that a halo of mass Mi = 5 ■ lO^^/i~^M0 at 
redshift Zi = 6.18 is more massive than M2 at redshift Z2 < Zi [P{M > M2, Z2\Mi, zi)]. The 
black line is the median of the distribution, the blue lines the la confidence levels and the 
green lines the 2a confidence levels. Red dotted lines: contour lines for the number density of 
dark matter halos in a 10^(Mpc//i)^ volume (1,10,100,1000,10000 halos from top to bottom), 
obtained using the Sheth & Tormen mass function. Small inset: Evolution of the main panel 
median line represented in terms of the extended Press-Schechter variable u. From this inset 
it is immediately clear that the descendant of a rare density peak progressively becomes 



more common. 
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Fig. 5. — As in fig. H] but for a dark matter lialo of mass Mi = lO^h ^M© formed at zi = 40. 



